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Abstract 

The present study is part of a broader programme, exploring the possibility of in¬ 
volving the Microsoft Kinect™ sensor in the analysis of human motion. We examine 
the output obtained from the two available versions of this sensor in relation to the 
variability of the estimates of the lengths of eight bones belonging to the subject’s 
extremities: of the humerus (upper arm), ulna (lower arm, forearm), femur (upper 
leg), and tibia (lower leg, shank). Large systematic effects in the output of the two 
sensors have been observed. 
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1 Introduction 


In a previous study [1], we established the theoretical background needed 
for the comparison of the output of measurement systems used in capturing 
data for the analysis of human motion. We developed our methodology for 
a direct application in case of the two versions of the Microsoft Kinect™ 
(hereafter, simply ‘Kinect’) sensor [2], a low-cost, portable motion-sensing 
hardware device, developed by the Microsoft Corporation (Microsoft, USA) 
as an accessory to the Xbox 360 video-game console (2010). The sensor is a 
webcamera-type, add-on peripheral device, enabling the operation of Xbox 
via gestures and spoken commands. The first upgrade of the sensor (‘Kinect 
for Windows v2’), both hardware- and software-wise, tailored to the needs of 




Xbox One, became available for general development and use in July 2014. 
In Ref. [3], we applied our methodology [1] to a comparative study of the two 
Kinect sensors and drew attention to significant differences in their output. 

Previous attempts to validate the Kinect sensor for various medical/health- 
relating applications have been discussed in Ref. [1]. The present study is part 
of our research programme, investigating the possibility of involving (either 
of) the Kinect sensors in the analysis of the motion of subjects walking or run¬ 
ning ‘in place’, e.g., on commercially-available treadmills. If successful, Kinect 
could become an interesting alternative to marker-based systems (MBSs) in 
capturing data for motion analysis, one with an incontestably high benefit-to- 
cost ratio. 

Studied herein is the dependence of the evaluated lengths of eight bones of the 
subject’s extremities on the kinematical variables pertaining to the viewing of 
these bones by the sensor. The bones are: humerus (upper arm), ulna (lower 
arm, forearm), femur (upper leg), and tibia (lower leg, shank). The evaluated 
lengths of the subject’s left and the right extremities are separately analysed^ 

Ideally, the lengths of the bones of the subject’s extremities should come 
out constant, irrespective of the orientation of these bones in space and of 
the viewing angle by the sensor. In reality, a departure from constancy is 
inevitable, given that the Kinect nodes represent centroids in the probability 
maps obtained from the captured data in each frame separately [4]; as such, the 
node-extraction process is subject to statistical fluctuations and is affected by 
different systematic effects in the three spatial directions. In the present work, 
we first examine the variability of the evaluated bone lengths with the variation 
of two angles describing the orientation of the bones in space, to be denoted 
hereafter as 9 and 0; 9 is the angle of the bone with the vertical, whereas 
cj) is the angle between the bone projection on the (anatomical) transverse 
plane and the z axis (the direction associated with the depth in the images 
obtained with the sensors). Subsequently, we investigate the dependence of 
the evaluated lengths of these bones on the inclination angle with respect to 
Kinect’s viewing direction. We pursue the determination of systematic effects 
in the Kinect-captured data, placing emphasis on establishing the similarities 
and the differences in the behaviour of the two sensors; in this respect, the 
present paper constitutes another comparative study of the two Kinect sensors, 
albeit from a perspective different to the one of Refs. [1,3]. 

The material in the present paper has been organised in four sections. In 
Section 2, we provide the details needed in the evaluation and in the analysis 
of the bone lengths. The results of the study are contained in Section 3. We 
discuss the implications of our findings in the last section. 

1 The subject’s left and right parts refer to what the subject perceives as the left 
and right parts of his/her body. 
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2 On the evaluation and analysis of the bone lengths 


In the original sensor, the skeletal data (‘stick figure’) of the output comprises 
20 time series of three-dimensional (3D) vectors of spatial coordinates, i.e., 
estimates of the ( x,y,z ) coordinates of the 20 nodes which the sensor associates 
with the axial and appendicular parts of the human skeleton. While walking 
or running, the subject faces the Kinect sensor. In coronal (frontal) view of 
the subject (sensor view), the Kinect coordinate system is defined with the 
x axis (medial-lateral) pointing to the left (i.e., to the right part of the body 
of the subject being viewed), the y axis (vertical) upwards, and the z axis 
(anterior-posterior) away from the sensor, see Fig. 1. In the upgraded sensor, 
five new nodes have been appended at the end of the list: one is a body node, 
whereas the remaining nodes pertain to the subject’s hands. In both versions, 
parallel to the video image, Kinect captures an infrared image, which enables 
the extraction of information on the depth z. The sampling rate in the Kinect 
output (for the video and the skeletal data, in both versions of the sensor) is 
30 Hz. As the Kinect output has already been detailed twice, in Sections 2.1 
of Refs. [1,3], there is no need to further describe it here. 

The ‘upper’ endpoint (upper or proximal extremity) of each bone will be iden¬ 
tified by the subscript S, the ‘lower’ endpoint (lower or distal extremity) by 
E. The upper and lower endpoints of the bones refer to the upright (erect) 
standing (rest) position of the subject (standard anatomical position); in this 
position, y,s > yE in all cases. The four bones pertaining to the upper extrem¬ 
ities (arms) are defined on the basis of the Kinect nodes SHOULDER_LEFT, 
ELBOW_LEFT, and WRIST_LEFT (left side); SHOULDER_RIGHT, EL¬ 
BOW-RIGHT, and WRIST_RIGHT (right side). The four bones pertaining 
to the lower extremities (legs) are defined using the Kinect nodes HIP_LEFT, 
KNEE-LEFT, and ANKLE-LEFT (left side); HIP-RIGHT, KNEE-RIGHT, 
and ANKLE-RIGHT (right side). 

Denoting the bone length as L — \J{xs — Xe ) 2 + (ys ~ He ) 2 + {zs — z e) 2 , the 
angle 9 is estimated via the expression 


Vs - Ve = Ay = L cos 9 


( 1 ) 


Evidently, 6 G [0,7r]; in the standard anatomical position, 6 = 0° (ideally). 
Additionally, 

xs ~ xe = Aa: = L sin 9 sin 0 (2) 


and 


zs ~ ze = A z = L sin 9 cos (f) . (3) 

Obviously, 0 = 0° when the bone projection on the transverse plane (x,z) 
is antiparallel to the z axis (i.e., when the projection of the vector SE on 
that plane points in the direction of the sensor). Denoting the projected bone 
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length on the transverse plane as L xz = y J ( Ax ) 2 + (A^) 2 = L sin 9 , we obtain 
the relation: 

, Az 

cos <fi = —— (4) 

L'xz 

for all 6 values, exempting 6 = 0 and 9 = n where the projected length L xz 
vanishes; although cases in the data where 9 comes out identical to 0 or n have 
not been found, we will assign (for the sake of mathematical completeness) 
the value of 0 to 0 in both cases. Evidently, (p G [0, 2n). In the present study, 
we will assume left/right symmetry (in the description of the motion of the 
bones) and restrict the angle <p hi the [0,7r] domain. 

Of relevance in the context of the present work is the angle at which the 
Kinect sensor views a bone. The inclination angle is obtained from the data 
as follows. Referring to Fig. 1, we define the position vectors pertaining to the 
bone endpoints as fs = ( xs,Us, z s ) an d f E = (%e, Ve, ze)- The unit vector, 
normal to the KSE plane, is obtained via the expression: 


u N 


te x f s 
II r E x f 5 1| 


(5) 


(We have not found instances in the data involving parallel or antiparallel 
vectors rg and fg .) The position vector of the midpoint M of the SE segment 
is given by 


r, s + r E 

r M = --- 


( 6 ) 


and the unit vector along that direction by 


u R 


Tm 

lkM|| 


(7) 


The unit vector ii, on the KSE plane, orthogonal to Ur, is obtained by the 
expression 


u R x u N 

u = TE- = u R xu N 

|| Ur X Un || 


( 8 ) 


The inclination u is the angle between SE = f E — fs and u: evidently, 


cosca = 


(r E - f s ) • u 

\\rE-r s \\ 


(9) 


As we are interested in the inclination of each bone, not in its orientation with 
respect to the unit vector u, we will use |coscu| as free variable. It is expected 
that the reliability of the Kinect-evaluated bone lengths should increase with 
increasing |cosca|. 
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3 Results 


The data acquisition involved one male adult (ZHAW employee), with no 
known motion problems, walking and running on a commercially-available 
treadmill (Horizon Laufband Adventure 5 Plus, Johnson Health Tech. GmbH, 
Germany). The placement of the treadmill in the laboratory of the Institute of 
Mechatronic Systems (School of Engineering, ZHAW), where the data acqui¬ 
sition took place, was such that the subject’s motion be neither hindered nor 
influenced in any way by close-by objects. Prior to the data-acquisition ses¬ 
sions, the Kinect sensors were properly centred and aligned. The sensors were 
then left at the same position, untouched throughout the data acquisition. 

It is worth mentioning that, as we are interested in capturing the motion of 
the subject’s lower-leg parts (i.e., of the ankle and foot nodes), the Kinect 
sensors must be placed at such a height that the number of lost lower-leg 
signals be kept reasonably small. Our past experience dictated that the Kinect 
sensor should be placed close to the minimal height recommended by the 
manufacturer, namely around 2 ft off the (treadmill-belt) floor. Placing the 
sensor higher (e.g., around the midpoint of the recommended range of values, 
namely at 4 ft off the treadmill-belt floor) leads to many lost lower-leg signals 
(the ankle and foot nodes are not properly tracked), as the lower leg is not 
visible by the sensor during a sizeable fraction of the gait cycle, shortly after 
toe-off. 

The Kinect sensor may lose track of the lower parts of the subject’s extremities 
(wrists, hands, ankles, and feet) for two reasons: either due to the particular¬ 
ity of the motion of the extremity in relation to the position of the sensor 
(e.g., the identification of the elbows, wrists, and hands becomes problematic 
in some postures, where the Kinect viewing angle of the ulnar bone is small) 
or due to the obstruction of the extremities of the human body (behind the 
subject) for a fraction of the gait cycle. Assuming that these instances remain 
rare (e.g., below about 3% of the available data in each time series, i.e., no 
more than one lost frame in 30), the missing values may be reliably obtained 
(interpolated) from the well-determined (tracked) data. Although, when nor¬ 
malised to the total number of available values, the untracked signals usually 
appear ‘harmless’, particular attention was paid in order to ensure that no 
node be significantly affected. 

Five velocities were used in the data acquisition: walking-motion data were 
acquired at 5 km/h; running-motion data at 8, 9, 10, and 11 km/h. At each 
velocity setting, the subject was given 1 min to adjust his movements comfort¬ 
ably to the velocity of the treadmill belt. The Kinect output spanned slightly 
over 2 min at each velocity. The variation of the distance between the subject 
and the Kinect sensors was monitored during the data acquisition; it ranged 
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between about 2.5 and 2.9 m, well within the limits of use of the sensors set 
by the manufacturer. The recording on the two measurement systems started 
simultaneously. 

At each sampled frame, the bone lengths were calculated from the Kinect 
output and were histogrammed in (cos 9, cos </>) cells; the same values were 
also histogrammed in bins of |coso;|. Twenty bins per angular direction were 
used in the former case, forty in the latter. Averages, as well as the standard 
errors of the means, were calculated in all cells/bins containing at least ten 
entries; cells/bins with fewer entries were ignored. 

Techniques yielding accurate results for in-vivo measurements of the human 
long bones include conventional (planar) radiography, CT scanning (e.g., see 
Ref. [5]), Raman spectroscopy [6], and ultrasonic scanning [7]. Unwilling or 
unable to use any of these techniques, we obtained static measurements of 
the subject’s bone lengths with an MBS (AICON 3D Systems GmbH, Ger¬ 
many) [8]. Our system features two digital cameras, a control unit, and a 
high-end personal computer (where the visualisation software is installed). 
The Movelnspect Technology HF|HR reconstructs the 3D coordinates of the 
centres of markers (adhesive targets, reflective balls) which are simultaneously 
viewed by the two cameras; the typical uncertainty in the determination of the 
3D coordinates of each marker centre is below 100 /irn, i.e., negligible when 
compared to other uncertainties, namely to those pertaining to the placement 
of the markers and to skin motion. The bone lengths, not to be identified 
herein with the suprema of the distances of any two points belonging to the 
bones regarded as 3D objects, were defined as follows. 

• Humerus: from the centre of the humeral greater tubercle (‘coinciding’ with 
the Kinect shoulder node) to the humeral lateral epicondyle. As it is not 
straightforward to identify the centre of the humeral greater tubercles, flat 
markers were placed on the greater tuberosities and the resulting humeral- 
bone lengths were reduced Lll by 39 mm. 

• Ulna: from just below the humeral medial epicondyle, on top of the humeral 
trochlea, to the ulnar styloid process. 


2 The description of the algorithm, used in the determination of the 3D positions of 
the skeletal joints of the subject being viewed with Kinect, may be found in Ref. [4]. 
The ‘3D positions of the joints’ of Ref. [4] are essentially produced from the ‘3D 
positions of the projections of the joints onto the front part of the subject’s body’ 
after applying a ‘shift’ in depth (i.e., from the surface to the interior of the subject’s 
body), namely a constant offset (£ c ) of 39 mm (see end of Section 3 of Ref. [4]). 
We will assume that the same correction is also applied to the infrared image in 
the vertical (y) direction (and, albeit not relevant herein, also in the medial-lateral 
(x) direction), in order to yield the y (and x) coordinates of the shoulder joints. 
Therefore, when comparing it with the Kinect-evaluated humeral-bone length, the 
MBS-evaluated length will be reduced by ( c . 
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• Femur: from the centre of the femoral head to just below the femoral medial 
condyle, on top of the medial meniscus. As the identification of the cen¬ 
tres of the femoral heads, using a non-invasive anthropometric technique, 
is neither easy nor accurate, we obtained the hip positions by following the 
indirect approach of Ref. [9], featuring four flat markers on the surface of 
the subject’s body (see Subsection 2.2 of Ref. [1]). The knee positions were 
identified in two ways: either using 014-mm reflective balls on top of the 
medial menisci and applying a correction (lateral shift) similar to the one 
applied in Ref. [9] or using plat markers on the patellae (knee caps) and 
applying a correction in depth (half of the width of the subject’s leg, at the 
knee level, in sagittal view); the two results were found almost identical. 

• Tibia: from just below the femoral medial condyle, on top of the medial 
meniscus, to the medial malleolus. 

All measurements were acquired in the upright position. The extracted values 
of the subject’s bone lengths are listed in Table 1. 

The average values, along with the rms (root-mean-square) values of the cor¬ 
responding distributions, of the Kinect-evaluated bone lengths are shown in 
Table 2, separately for the two Kinect sensors. A noticeable difference be¬ 
tween the two sets of values occurs in the femoral-bone lengths, the values 
of which come out, in all cases, significantly smaller when using the data of 
the upgraded sensor; this underestimation is equivalent to an effect between 
1.69 and 2.42 standard deviations in the normal distribution. The quoted un¬ 
certainties in Table 2 are large, indicating that systematic effects come into 
play, thus suggesting further analysis of the bone lengths, in terms of the 
kinematical variables 6 , (j), and to. 


3.1 Profile scatter plots of the bone lengths in (cos 9 ,cos fi) cells 


Walking and running motions have different signatures; the differences are 
both quantitative and qualitative: the ranges of motion in running are larger, 
generally expected to increase with increasing velocity; qualitatively, the walk¬ 
ing motion is characterised by extended elbow joints, the running motion by 
flexed ones. This last dissimilarity affects the detection of the elbows and of 
the wrists at several postures in running motion, inevitably introducing sys¬ 
tematic effects in the evaluation of the humeral- and ulnar-bone lengths. 

Our first study of the systematic effects in the evaluation of the bone lengths 
involved profile scatter plots in (cos 6 ,cos <f >) cells. For convenience, the follow¬ 
ing definitions will be used in the short description of the results obtained in 
this part of the analysis. 

• Forward (ventral) placement of a bone corresponds to a position of its lower- 
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endpoint joint more proximal to the Kinect sensor than the mid-coronal 
plane; ‘very forward’ placement refers to cos 0 > 0.9. 

• Backward (dorsal) placement of a bone corresponds to a position of its 
lower-endpoint joint more distal to the Kinect sensor than the mid-coronal 
plane; ‘very backward’ placement refers to cos 0 < —0.9. 

In general, the walking motion is restricted to small 9 values; only in the case 
of the ulna does the motion extend to 9 « 60°. The humeral-bone lengths came 
out cos ^-dependent; systematically smaller values were extracted in forward 
placement, larger in backward. Regarding the ulna, the 9 domain is somewhat 
enlarged in very forward placement, when the elbow joint is (usually) flexed. 
Regarding the tibia, a significant dependence of the evaluated length on cos 0 
was seen in very backward placement; the effect was maximal around the 
most distal position of the ankle (with respect to the Kinect sensor), where 
the shank is not viewed sufficiently well by the sensor. 

Compared to walking motion, the domain of the 9 values is significantly en¬ 
hanced in running (as expected). The evaluated humeral-bone lengths were 
(again) found cos 0-dependent. Regarding the ulna, the cos0 values were large 
(restriction of the motion to very forward placement). For the tibial bones, the 
evaluated lengths in very backward placement were found to be significantly 
larger than those obtained elsewhere. Again, the largest bone lengths were 
extracted for large 9 and 0 values (large dorsal elevation of the lower leg), 
where the tibial bone is not viewed sufficiently well by the Kinect sensor. 

In all cases, the relative minimax variation d r = 2 (M — m)/(M + m) (where 
M and m stand for corresponding maximal and minimal values, respectively) 
of the extracted values of the bone lengths was large. The d r ranges (over all 
velocities) were: for the humerus, 3-22% (original sensor) and 4-20% (upgraded 
sensor); for the ulna, 7-29% (original sensor) and 8-35% (upgraded sensor); 
for the femur, 4-13% (original sensor) and 7-17% (upgraded sensor); and for 
the tibia, 18-24% (original sensor) and 24-26% (upgraded sensor). Overall, 
the matching of the results between the two sensors in the case of running 
motion was not satisfactory; the values of Pearson’s correlation coefficient (on 
the d r values of the eight bone lengths at fixed velocity) showed a pronounced 
velocity dependence, ranging from 0.958 at 5 km/h to 0.260 at 11 km/h. 


3.2 Profile histograms of the bone lengths in |coscu| bins 


The analysis described in the previous subsection is suggestive of the direction 
which the investigation must next turn to. In short, it appears plausible to 
assume that the goodness of the evaluation of the bone lengths does not depend 
separately on the angles 9 and 0, but on another quantity, namely on the 



viewing angle of the particular bone by the Kinect sensor. It is reasonable to 
expect that the accuracy in the evaluation of each bone length depends on the 
inclination of that bone with respect to the Kinect viewing direction; when a 
bone is viewed by the Kinect sensor at almost right angle, its length should 
be evaluated more reliably. The appropriate free variable in this investigation, 

|costa|, is obtained via Eq. (9). 

The results, obtained from the data analysis, are shown in Figs. 2, 4, 6, and 
8 for the original sensor; in Figs. 3, 5, 7, and 9 for the upgraded sensor. We 
now discuss these plots. 

• Humerus. There can be little doubt that both Kinect sensors systemati¬ 
cally underestimate the humeral-bone length regardless of the type of the 
motion (walking or running). Only in the case |cosca| —» 1 in the walking- 
motion data are the estimates close to the results of the static measure¬ 
ments. All humeral-bone lengths obtained for |coso;| —> 1 in the case of the 
running-motion data agree well, and are about 65-75 mm short of the results 
of the static measurements. This discrepancy is clue to the misplacement of 
the arm joints during running, in particular of the elbows, which are kept in 
flexed position throughout the gait cycle. Compared to the original sensor, 
the upgraded sensor generally yields slightly shorter humeral-bone lengths. 

• Ulna. In the data obtained with the original sensor, some left/right asym¬ 
metry in the results is visible. The range of variation of the ulnar-bone 
length was found maximal (about 10 cm) in the case of the original sensor, 
for the right ulna. Regarding the walking-motion data for |coscu| —> 1, the 
extracted values do not disagree with the results of the static measurements. 

• Femur. The femoral-bone length, obtained with the original sensor, is se¬ 
riously overestimated in all cases. The hip positions are mostly responsible 
for this discrepancy. (In Ref. [3], we reported that the waveforms for the 
hips, obtained from the two versions of the sensor, do not match well.) 

• Tibia. In all cases, both sensors seriously overestimate the tibial-bone lengths 
for small viewing angles and underestimate them for |costu| —> 1. The range 
of variation of the tibial-bone length was found sizeable, namely between 9 
and 11 cm. On the other hand, a nearly monotonic behaviour is observed 
in Figs. 8 and 9, and the results from the walking- and the running-motion 
data generally appear to be consistent between the two sensors. 

It must be mentioned that the dependence of the bone lengths L on |coscu| is 
expected to be monotonic. Visual inspection of Figs. 2-9 reveals that this is 
not always the case. One reason for the observed departure from a monotonic 
behaviour (yet not the only one) is that the inclination u is estimated from 
the Kinect-captured data; systematic effects also affect this estimation. 

The results, reported in the present section, have also been checked against 
influences from ‘cross-talk’, which might be present in the output of the two 
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Kinect sensors. Both sensors use reflected infrared light in order to yield in¬ 
formation on the depth in the captured images; one might thus argue that, in 
case of a simultaneous data acquisition, they distort one another’s recording. 
To clarify this issue, data (using the same subject and velocity settings) were 
acquired with the two measurement systems ‘serially’, first with the original 
sensor, subsequently with the upgraded one; in both cases, the second sensor 
was switched off (but was not removed from the mount). The differences to the 
results reported herein were inessential. Our conclusion is that the aforemen¬ 
tioned discrepancies cannot be due to an interaction between the two sensors. 


4 Conclusions 


The present paper addressed one use of the output of the two Microsoft 
Kinect™ (hereafter, simply ‘Kinect’) sensors, namely the evaluation of the 
lengths of eight bones pertaining to the extremities of one subject walking 
and running on a treadmill: of the humerus (upper arm), ulna (lower arm, 
forearm), femur (upper leg), and tibia (lower leg, shank). For comparison, 
static measurements of these bone lengths were obtained with a marker-based 
system. 

The evaluated lengths of the left and right parts of the subject’s extremities 
have been separately analysed. The constancy of the lengths of these eight 
bones in terms of the variation of the two angles involved in the viewing, 6 of 
Eq. (1) and (j) of Eq. (4), was examined. We have also investigated the depen¬ 
dence of the bone lengths on the inclination angle c o of Eq. (9) with respect to 
Kinect’s viewing direction. We pursued the analysis of systematic effects in the 
output, emphasising on the similarities and the differences between the two 
sensors; in this respect, the present study is another comparative study of the 
two Kinect sensors, albeit from a perspective different to that of Refs. [1,3]. 

The walking motion is characterised by extended elbow joints and small 6 
values in the case of the humerus and of the femur; in the case of the ulna, 
the 6 values reached about 60°. The running motion is characterised by flexed 
elbow joints and (compared to the walking motion) extends more in 6. The 
motion of the ulnar bones is different in walking and running; in the latter 
case, the ulnar motion extends in 6 , rather than in (f). A major overestimation 
of the tibial-bone lengths occurs in very backward placement, where these 
bones cannot be viewed sufficiently well by the Kinect sensors. 

The analysis of the arm-bone lengths in terms of the inclination angle oj 
demonstrated that the results obtained with both sensors do not agree well 
with those of the static measurements; agreement occurs only in the walking- 
motion data for |coscu| —> 1, where the bones are viewed at almost right angle 
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by the sensor. We advanced an argument providing an explanation of this 
effect: it is mainly due to the systematic misplacement of the elbow nodes in 
the running motion, where this joint is kept in flexed position throughout the 
gait cycle. Regarding the leg bones, there is no doubt that the original Kinect 
sensor seriously overestimates the femoral-bone length. Both sensors overesti¬ 
mate the tibial-bone lengths for small viewing angles and underestimate them 
for large; discrepancies in the evaluated length of these bones lie in the vicinity 
of 9-11 cm, see Figs. 8 and 9. 

The present work corroborates earlier results [10], obtained with the original 
sensor (the upgraded sensor was not available at the time that study was 
conducted), that Kinect cannot be easily employed in medical/health-relating 
applications requiring high accuracy. In most cases, the results obtained with 
the two sensors disagree with the static measurements and show a large range 
of variation within the gait cycle. The determination and application of cor¬ 
rections, needed in order to suppress these artefacts, comprises an interesting 
research subject. 

Finally, we would like to comment on one misconception in the held of Med¬ 
ical Physics, namely that the results of studies using one subject, or only a 
few subjects, are not reliable. In our opinion, there are studies which call for 
statistics and studies in which statistics is superfluous. When comparing two 
measurement systems and the results (for a few subjects) come out sufficiently 
close, it does make sense to ‘pursue statistics’ and obtain reliable estimates 
of averages, standard deviations, and ranges. On the contrary, there are occa¬ 
sions (in particular, in the validation of measurement systems) where serious 
discrepancies are found in the output already obtained from the first subject. 
Unless one explains why the comparison of the output of the two measurement 
systems failed for that one subject, one subject is sufficient in invalidating the 
application! Even in case that the tested measurement system failed for the 
specific subject (and that it does not fail for the general subject), its valida¬ 
tion must be performed for every future subject separately, to guarantee the 
validity of the output on a case-by-case basis ; of course, this is not the essence 
of validations of measurement systems. 
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Table 1 


The values of the subject’s bone lengths (in mm), obtained from a marker-based 
system (see Section 3). To account for incorrect placement of the markers, an uncer¬ 
tainty of 10 mm is assumed in all cases, save for the femoral-bone lengths, where an 
overall uncertainty of 20 mm is applicable (linear combination of the placement un¬ 
certainty of 10 mm and of a 10-mm uncertainty representing the systematic effects 
of Ref. [9], as discussed in Section 2.2 of Ref. [1]). These values have been verified 
with a non-stretchable tape measure. 


Left humerus 

343 

Right humerus 

344 

Left ulna 

258 

Right ulna 

271 

Left femur 

434 

Right femur 

436 

Left tibia 

434 

Right tibia 

427 
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Table 2 


The average values of the subject’s bone lengths (in mm), separately for the two 
Kinect sensors. 



5 km/h 

8 km/h 

9 km/h 

10 km/h 

11 km/h 

Original Kinect sensor 

Left humerus 

278(23) 

249(22) 

245(16) 

249(16) 

250(14) 

Right humerus 

284.1(6.1) 

239(12) 

240(13) 

244(18) 

247(15) 

Left ulna 

293(18) 

271(27) 

265(29) 

253(24) 

251(26) 

Right ulna 

277(15) 

282(24) 

268(24) 

249(30) 

251(30) 

Left femur 

531(29) 

517(25) 

513(30) 

510(28) 

503(30) 

Right femur 

519(23) 

512(17) 

512(25) 

508(25) 

506(30) 

Left tibia 

403(36) 

431(35) 

436(36) 

440(38) 

443(40) 

Right tibia 

418(30) 

439(35) 

445(37) 

447(37) 

451(36) 

Upgraded Kinect sensor 

Left humerus 

268(20) 

241(10) 

240.3(6.6) 

243.3(7.4) 

244.0(8.0) 

Right humerus 

273(10) 

236(12) 

235(13) 

238(14) 

239(14) 

Left ulna 

248(16) 

253(30) 

255(30) 

244(28) 

241(27) 

Right ulna 

253(11) 

259(33) 

246(30) 

228(21) 

230(23) 

Left femur 

409(42) 

422(33) 

428(39) 

423(41) 

419(40) 

Right femur 

427(35) 

432(28) 

438(34) 

430(38) 

418(41) 

Left tibia 

422(48) 

420(41) 

422(42) 

423(44) 

425(45) 

Right tibia 

409(48) 

411(39) 

415(40) 

418(41) 

428(42) 
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Fig. 1. The coordinate system of the Kinect sensor. The endpoints of the specific 
bone are identified as S and E. The angles 9 and 4> define the orientation of the 
bone in space, whereas the angle u pertains to the Kinect view of the bone; when 
Kinect views the bone at right angle, oj = 0°. The (focal point of the) camera of the 
sensor appears in the figure as point K. 
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Left humerus Right humerus 



Fig. 2. The profile histogram of the humeral-bone length (separately for the left- 
and right-side bones) in |cosw| bins; the data has been captured with the original 
Kinect sensor. In walking motion, the swinging of the right arm of the subject used 
in our data acquisition, is very small. 


16 



















Left humerus Right humerus 



Fig. 3. The profile histogram of the humeral-bone length (separately for the left- 
and right-side bones) in |coscu| bins; the data has been captured with the upgraded 
Kinect sensor. In walking motion, the swinging of the right arm of the subject used 
in our data acquisition, is very small. 
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Left ulna Right ulna 



Fig. 4. The profile histogram of the ulnar-bone length (separately for the left- and 
right-side bones) in |cosw| bins; the data has been captured with the original Kinect 
sensor. In walking motion, the swinging of the right arm of the subject used in our 
data acquisition, is very small. 
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Left ulna Right ulna 



Fig. 5. The profile histogram of the ulnar-bone length (separately for the left- and 
right-side bones) in |cosw| bins; the data has been captured with the upgraded 
Kinect sensor. In walking motion, the swinging of the right arm of the subject used 
in our data acquisition, is very small. 
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Fig. 6. The profile histogram of the femoral-bone length (separately for the left- 
and right-side bones) in |cosw| bins; the data has been captured with the original 
Kinect sensor. 
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Left femur Right femur 



Fig. 7. The profile histogram of the femoral-bone length (separately for the left- 
and right-side bones) in |cosw| bins; the data has been captured with the upgraded 
Kinect sensor. 
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Left tibia Right tibia 



Fig. 8. The profile histogram of the tibial-bone length (separately for the left- and 
right-side bones) in |cos uj\ bins; the data has been captured with the original Kinect 
sensor. 


22 


















Left tibia Right tibia 



Fig. 9. The profile histogram of the tibial-bone length (separately for the left- and 
right-side bones) in |cosw| bins; the data has been captured with the upgraded 
Kinect sensor. 


23 


















